Document Processing, Neural OCR, Multilingual Archives, Computational Philology

Feeds to Scour
SubscribedAll
Scoured 9573 posts in 416.8 ms
No Language Left Behind: Scaling Human-Centered Machine Translation
dev.to·1d·
Discuss: DEV
🎙️Whisper
Preview
Report Post
Natural language processing for word sense disambiguation and information extraction
arxiv.org·19h·
Discuss: r/compsci
📥Feed Aggregation
Preview
Report Post
How OCR Impacts the Accuracy of Document Translation
dev.to·11h·
Discuss: DEV
✏️OCR Correction
Preview
Report Post
Yann LeCun’s VL-JEPA: The breakthrough that gives AI a "Mind's Eye" (instead of just a mouth).
hisohan.substack.com·10h·
Discuss: Substack
🔲Cellular Automata
Preview
Report Post
Stanford CS 224N | Natural Language Processing with Deep Learning
web.stanford.edu·1d
🧠Machine Learning
Preview
Report Post
Document Parsing with LLMs: From OCR to Structural Understanding.
alamedadev.com·3d
📋Document Grammar
Preview
Report Post
The Transformer Architecture: A Deep Dive into How LLMs Actually Work
dev.to·8h·
Discuss: DEV
📝Text Parsing
Preview
Report Post
Retrotechtacular: IBM’s The World of OCR
hackaday.com·1d
📄OCR
Preview
Report Post
Exploiting Similarities among Languages for Machine Translation
dev.to·2d·
Discuss: DEV
🔄Palindrome Detection
Preview
Report Post
SMART SLM: Structured Memory and Reasoning Transformer, A Small Language Model for Accurate Document Assistance
arxiv.org·2d
📋Document Grammar
Preview
Report Post
How Is AI Upgrading the Legal Industry and Making Law Smarter?
open.forem.com·1d·
Discuss: DEV
🤖AI Curation
Preview
Report Post
SearchResearch (12/24/25): Living in an AI world that kinda, sorta works for OCR
searchresearch1.blogspot.com·3d·
👁️OCR Evolution
Preview
Report Post
Building an AI Document Processing Pipeline on AWS (Textract + Bedrock)
dev.to·15h·
Discuss: DEV
🔄Archival Workflows
Preview
Report Post
Synthetic Data and Artificial Neural Networks for Natural Scene Text Recognition
dev.to·15h·
Discuss: DEV
🤖Advanced OCR
Preview
Report Post
TRUNAJOD: A text complexity library for text analysis built on spaCy — TRUNAJOD 0.1.1 documentation
trunajod20.readthedocs.io·13h
📝Parsing Grammars
Preview
Report Post
wwes4/AI_Accel_1.5x: AI acceleration framework for ~1.5x speedups in mid-sized models via tension-based pruning. Built utilizing xAI's Grok.
github.com·1d·
Discuss: Hacker News
📊Quantization
Preview
Report Post
eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers
dev.to·4h·
Discuss: DEV
🤖Manuscript AI
Preview
Report Post
Unifying Visual-Semantic Embeddings with Multimodal Neural Language Models
dev.to·1d·
Discuss: DEV
🧮Vector Embeddings
Preview
Report Post
I built an AI app for deep research, reverse image search, and price comparison
apps.apple.com·3d·
Discuss: Hacker News
🤖AI Curation
Preview
Report Post
The 2025 Guide to Machine Learning
ibm.com·1d·
Discuss: Hacker News
🧠Machine Learning
Preview
Report Post